Search CORE

18 research outputs found

Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments

Author: Kebria Parham M.
Khosravi Abbas
Zare Maryam
Publication venue
Publication date: 12/10/2023
Field of study

Most Reinforcement Learning (RL) methods are traditionally studied in an active learning setting, where agents directly interact with their environments, observe action outcomes, and learn through trial and error. However, allowing partially trained agents to interact with real physical systems poses significant challenges, including high costs, safety risks, and the need for constant supervision. Offline RL addresses these cost and safety concerns by leveraging existing datasets and reducing the need for resource-intensive real-time interactions. Nevertheless, a substantial challenge lies in the demand for these datasets to be meticulously annotated with rewards. In this paper, we introduce Optimal Transport Reward (OTR) labelling, an innovative algorithm designed to assign rewards to offline trajectories, using a small number of high-quality expert demonstrations. The core principle of OTR involves employing Optimal Transport (OT) to calculate an optimal alignment between an unlabeled trajectory from the dataset and an expert demonstration. This alignment yields a similarity measure that is effectively interpreted as a reward signal. An offline RL algorithm can then utilize these reward signals to learn a policy. This approach circumvents the need for handcrafted rewards, unlocking the potential to harness vast datasets for policy learning. Leveraging the SurRoL simulation platform tailored for surgical robot learning, we generate datasets and employ them to train policies using the OTR algorithm. By demonstrating the efficacy of OTR in a different domain, we emphasize its versatility and its potential to expedite RL deployment across a wide range of fields.Comment: Preprin

arXiv.org e-Print Archive

A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges

Author: Kebria Parham M.
Khosravi Abbas
Nahavandi Saeid
Zare Maryam
Publication venue
Publication date: 05/09/2023
Field of study

In recent years, the development of robotics and artificial intelligence (AI) systems has been nothing short of remarkable. As these systems continue to evolve, they are being utilized in increasingly complex and unstructured environments, such as autonomous driving, aerial robotics, and natural language processing. As a consequence, programming their behaviors manually or defining their behavior through reward functions (as done in reinforcement learning (RL)) has become exceedingly difficult. This is because such environments require a high degree of flexibility and adaptability, making it challenging to specify an optimal set of rules or reward signals that can account for all possible situations. In such environments, learning from an expert's behavior through imitation is often more appealing. This is where imitation learning (IL) comes into play - a process where desired behavior is learned by imitating an expert's behavior, which is provided through demonstrations. This paper aims to provide an introduction to IL and an overview of its underlying assumptions and approaches. It also offers a detailed description of recent advances and emerging areas of research in the field. Additionally, the paper discusses how researchers have addressed common challenges associated with IL and provides potential directions for future research. Overall, the goal of the paper is to provide a comprehensive guide to the growing field of IL in robotics and AI.Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibl

arXiv.org e-Print Archive

A Review of Machine Learning-based Security in Cloud Computing

Author: Babaei Aptin
Dalvand Mohsen Moradi
Kebria Parham M.
Nahavandi Saeid
Publication venue
Publication date: 09/09/2023
Field of study

Cloud Computing (CC) is revolutionizing the way IT resources are delivered to users, allowing them to access and manage their systems with increased cost-effectiveness and simplified infrastructure. However, with the growth of CC comes a host of security risks, including threats to availability, integrity, and confidentiality. To address these challenges, Machine Learning (ML) is increasingly being used by Cloud Service Providers (CSPs) to reduce the need for human intervention in identifying and resolving security issues. With the ability to analyze vast amounts of data, and make high-accuracy predictions, ML can transform the way CSPs approach security. In this paper, we will explore some of the most recent research in the field of ML-based security in Cloud Computing. We will examine the features and effectiveness of a range of ML algorithms, highlighting their unique strengths and potential limitations. Our goal is to provide a comprehensive overview of the current state of ML in cloud security and to shed light on the exciting possibilities that this emerging field has to offer.Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibl

arXiv.org e-Print Archive

A Review on Robot Manipulation Methods in Human-Robot Interactions

Author: Kebria Parham M.
Mohamed Shady
Nahavandi Saeid
Yu Samson
Zhang Haoxu
Publication venue
Publication date: 09/09/2023
Field of study

Robot manipulation is an important part of human-robot interaction technology. However, traditional pre-programmed methods can only accomplish simple and repetitive tasks. To enable effective communication between robots and humans, and to predict and adapt to uncertain environments, this paper reviews recent autonomous and adaptive learning in robotic manipulation algorithms. It includes typical applications and challenges of human-robot interaction, fundamental tasks of robot manipulation and one of the most widely used formulations of robot manipulation, Markov Decision Process. Recent research focusing on robot manipulation is mainly based on Reinforcement Learning and Imitation Learning. This review paper shows the importance of Deep Reinforcement Learning, which plays an important role in manipulating robots to complete complex tasks in disturbed and unfamiliar environments. With the introduction of Imitation Learning, it is possible for robot manipulation to get rid of reward function design and achieve a simple, stable and supervised learning process. This paper reviews and compares the main features and popular algorithms for both Reinforcement Learning and Imitation Learning

arXiv.org e-Print Archive

Drivers awareness evaluation using physiological measurement in a driving simulator

Author: Kebria Parham M.
Khosravi Abbas
Koohestani Afsaneh
Nahavandi Saeid
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Deakin Research Online

Crossref

Deep imitation learning for autonomous vehicles based on convolutional neural networks

Author: Kebria Parham M.
Khosravi Abbas
Nahavandi Saeid
Salaken Syed Moshfeq
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Deakin Research Online

Robust adaptive control scheme for teleoperation systems with delay and uncertainties

Author: Alizadehsani Roohallah
Kebria Parham M.
Khosravi Abbas
Nahavandi Saeid
Shi Peng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2020
Field of study

Deakin Research Online

Experimental comparison study on joint and cartesian space control schemes for a teleoperation system under time-varying delay

Author: Homaifar Abdollah
Kebria Parham M.
Khosravi Abbas
Nahavandi Saeid
Saif Mehrdad
Publication venue: Scholarship at UWindsor
Publication date: 01/01/2019
Field of study

In this paper, teleoperation control of a UR5 robotic manipulator under time-varying delay is addressed. Two control methods of joint and Cartesian space are investigated for the teleoperation system. The singularity situations of the UR5 robot are considered for the case of different degrees of freedom (DoF), and also different kinematic properties for the master and slave systems. We first focus on a Cartesian space control of a teleoperation system which is common in the literature. It is shown that this control methodology is highly prone to singular situations. To overcome this problem, a Jacobian-based control strategy in joint space is presented. An extensive simulation study demonstrates the superiority and reliability of the later control methodology

Deakin Research Online

Crossref

Scholarship at UWindsor

Type-2 fuzzy neural network synchronization of teleoperation systems with delay and uncertainties

Author: Jalali Seyed Mohammad Jafar
Kebria Parham M.
Khosravi Abbas
Nahavandi Saeid
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Deakin Research Online

Crossref